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THE MAILING DATE OF THIS COMMUNICATION. 
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Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1)S Responsive to (X)mmunication(s) filed on 07 February 2005 . 
2a)l3 This action is FINAL. 2b)n This action is non-final. 

3) n Since this application is in con(jition for allowance except for fomnal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 

Disposition of Claims 
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4a) Of the above claim(s) is/are withdrawn from consideration. 

5) 0 Claim(s) is/are allowed. 

e)M Claim(s) 1. 6. 7. 9-13. 18, 19. and 21-31 is/are rejected. 
?)□ Claim(s) is/are objected to. 

8) n Claim(s) are subject to restriction and/or election requirement. 

Application Papers 
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10) 13 The drawing(s) filed on 28 March 2001 is/are: a)S accepted or b)n objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 185(a). 
Replacement drawing sheet(s) including the con-ection is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
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application from the International Bureau (PCT Rule 17.2(a)). 
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DETAILED ACTION 
Response to Amendment 

1 . In response to the office action from 1 1/05/2004, the applicant has submitted an 
amendment, filed 2/07/2005, amending Claim 25, while adding Claims 26-3 1 and arguing to 
traverse the art rejection based on the limitation regarding a modification distortion being a 
distortion between a respective unmodified individual synthesis unit and the individual synthesis 
unit after modification (Amendment, Pages 10-11). Applicant's arguments have been fully 
considered, however the previous rejection is maintained (altered only with respect to amended 
claim 25 and added claims 26-31) due to the reasons listed below in the response to arguments. 

2. Based on the amendment to Claim 25, the examiner has withdrawn the previous 
objections directed towards an improper dependent claim. 

Response to Arguments 

3. Applicant's arguments have been fully considered but they are not persuasive for the 
following reasons: 

With respect to Claim 1, the applicant argues that Zinser fails to teach a modification 
distortion being a distortion between a respective unmodified individual synthesis unit and the 
individual synthesis unit after modification (Amendment, Pages 10-11), however the examiner 
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notes that Zinser teaches a means for determining a modification distortion between an 
unmodified speech synthesis candidate (input speech) and a related codebook synthesis unit. 
The "modification" in Zinser is performed by replacing an unmodified speech synthesis 
candidate in the form of input speech with a replaced codebook value for that same candidate 
and then compensating for a pitch difference (distortion) between the two units according to a 
scaling factor (Col 3, Lines 5-37). 

In response to applicant's argument that the references fail to show certain features of 
appUcant's invention, it is noted that the features upon which applicant relies (Le,, arguing that 
the modification process of Zinser detailed above is not the same as that of the present invention 
(Amendment, Pages 10-11), when the specific meaning of ''modification'' and ''distortion*' have 
not been specifically detailed in the aforementioned claims) are not recited in the rejected 
claim(s). Although the claims are interpreted in light of the specification, limitations from the 
specification are not read into the claims. See In re Van Gems, 988 F.2d 1 181, 26 
USPQ2d 1057 (Fed Cir. 1993). 

In the presently claimed invention the claim language does not specify the meaning of 
"modification," thus the teachings of Zinser and Kagoshima would anticipate the claimed 
invention. The examiner points out that an amendment clearly distinguishing what is meant by 
"modification" and how that modification is utilized in determining a "modification distortion" 
(the meaning of distortion) according to the present invention may overcome the prior art of 
record. 

The additional independent claims contain subject matter similar to claim 1, and thus, 
also remain rejected for the above noted reasons. 
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Dependent claims are argued as further limiting rejected base clainos, and thus, also 
remain rejected. 

Claim Objections 

4. Claims 22-23 and 29-30 are objected to because of the following informalities: in Line 
3 of claims 22-23 and 29-30, "the table" should be changed to -a table— in order to correct a 
lack of proper antecedent basis. 

Appropriate correction is required. 

Claim Rejections - 35 USC §103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

6. Claims 1, 12, 13, and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kagoshima et al (UX Patent: 6, 240 J 84) in view of Zinser (U.S. Patent: 4,980,916). 

With respect to Claims 1 and 13, Kagoshima discloses: 

Distortion obtaining means for obtaining a modification distortion between synthesis 
units before and after modification (distortion calculator for determining a distortion between a 
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synthesis speech segment and a training speech segment, Col 13, Lines 58-60, Also, the training 
speech segment is modified with respect to pitch and duration to generate a synthesis speech 
segment. Col 8, Lines 62-66). 

Selection means for selecting synthesis units based on the modification distortion 
obtained by said distortion obtaining means (selecting synthesis units that minimize distortion 
based on a distance comparison between synthesis and training units, Col 2, Lines 58-62); and 

Speech synthesis means for performing speech synthesis based on the synthesis units 
selected by said selection means (speech synthesizer. Fig. 1, Element 15), 

Kagoshima does not teach that the modification is obtained between an unmodified 
individual synthesis unit and the same individual unit after modification, however Zinser teaches 
a pitch error minimizer, which compares a pitch-altered synthesized speech sequence to an input 
or unmodified sequence to determine a distortion (error) (Col 3, Lines 5-37), 

Kagoshima and Zinser are analogous art because they are from a similar field of endeavor 
in speech synthesis. Thus, it would have been obvious to a person of ordinary skill in the art, at 
the time of invention, to modify the teachings of Kagoshima with the means of determining a 
modification error as a difference between modified and unmodified versions of an individual 
synthesis units as taught by Zinser to improve synthesized speech quality for individual speech 
segments by selecting speech candidates for synthesis capable of minimizing a perceptual error 
(Zinser, Col 3, Lines 20-26). 

With respect to Claims 12 and 24, Kagoshima further recites: 

Input means and step for inputting text data (input text, Col 8, Line 10, that would 
inherently be inputted via a text input means)', 
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Language analysis means and step for performing language analysis of the text data 
(language processing of an input text. Col. 15, Lines 41-43)\ and 

Prosody-parameter generation means and step for generating predetermined prosody 
parameters based on a result of analysis of said language analysis means and step (obtaining 
prosody information from language processing, Col 15, Lines 41-43). 

Wherein said distortion obtaining means obtains the modification distortion based on the 
predetermined prosody parameters generated by said prosody parameter generation means 
(distortion calculator for determining a distortion between a synthesis speech segment (training 
segment with added prosody information) and a training speech segment. Col 13, Lines 58-60, 
Also, the training speech segment is modified with respect to pitch and duration to generate a 
synthesis speech segment, Col 8, Lines 62-66, according to prosody information, Fig. 1, Element 
111). 

7. Claims 6 and 18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kagoshima et al in view of Zinser, and further in view of Huang et al (U.S. Patent: 5,913,193). 

With respect to Claims 6 and 18, Kagoshima in view of Zinser teaches the speech 
synthesis apparatus and method that utilizes a modification distortion, calculated as the distance 
between an individual synthesis unit before and after modification, in selecting a best speech unit 
for synthesizing speech, as applied to Claims 1 and 13. Kagoshima in view of Zinser does not 
teach obtaining a distortion by adding modification and concatenation distortion, however Huang 
discloses: 
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A speech signal processing apparatus and method, wherein the distortion obtaining means 
uses a value obtained by adding the obtained modification distortion between the synthesis units 
before and after modification and a concatenation distortion (spectral distortion between 
adjacent instances, Col 5, Lines 1-6) generated by concatenating a synthesis unit to another 
synthesis unit (summing the distortions of an instance sequence, Col. P, Lines 44-47). 

Kagoshima, Zinser, and Huang are analogous art because they are from a similar field of 
endeavor in speech synthesis. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Kagoshima in view of Zinser with the 
method of summing distortions including a concatenation distortion as taught by Huang to 
further provide more natural synthesized speech by selecting a best synthesis unit dually based 
upon concatenation and modification distortion, thus minimizing distortion due to concatenation 
to create smooth transitions between speech units and modification to ensure natural sounding 
speech in the instance of a prosody change. 

8. Claims 7 and 19 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kagoshima et al in view of Zinser, in further view of Huang et al, and in yet further view of 
Campbell et al (U.S. Patent: 6,366,883). 

With respect to Claims 7 and 19, Kagoshima in view of Zinser, and further in view of 
Huang teaches the speech synthesis system capable of selecting best speech instances based upon 
a concatenation and modification distortion sum, as applied to Claims 6 and 18. Kagoshima in 
view of Zinser, and further in view of Huang does not teach calculating a distortion as a 
weighted sum of modification and concatenation distortion, however Campbell discloses: 
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A speech signal processing apparatus and method, wherein the distortion obtaining means 
calculates a weighted sum of the modification distortion between the synthesis units before and 
after modification and the concatenation distortion generated by concatenating a synthesis unit to 
another synthesis unit (selecting a speech unit based upon weighted coefficient vectors, Col. 2, 
Lines 37-38). 

Kagoshima, Zinser, Huang, and Campbell are analogous art because they are fi-om a 
similar field of endeavor in speech synthesis. Thus, it would have been obvious to a person of 
ordinary skill in the art, at the time of invention, to modify the teachings of Kagoshima in view 
of Zinser, and further in view of Huang with the method of selecting a speech unit based upon a 
weighted coefficient vector as taught by Campbell to provide a means of minimizing 
concatenation cost expressed through a weighting function and thus providing higher quality and 
audible synthesized speech. 

9. Claims 9 and 21 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kagoshima et al in view of Zinser, and further in view of Campbell et al. 

With respect to Claims 9 and 21, Kagoshima in view of Zinser teaches the speech 
synthesis apparatus and method that utilizes a modification distortion, calculated as the distance 
between an individual synthesis unit before and after modification, in selecting a best speech unit 
for synthesizing speech, as appHed to Claims 1 and 13. Kagoshima in view of Zinser does not 
specifically suggest calculating modification distortion using a cepstrum distance, however 
Campbell discloses: 
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A speech signal processing apparatus and method, wherein said distortion obtaining 
means calculates the modification distortion using a cepstrum distance (distortion calculation 
based upon prosodic feature parameters calculated from acoustic characteristics of speech units, 
namely, cepstral distance, Col 12, Lines 1-36). 

Kagoshima, Zinser, and Campbell are analogous art because they are from a similar field 
of endeavor in speech synthesis. Thus, it would have been obvious to a person of ordinary skill 
in the art, at the time of invention, to modify the teachings of Kagoshima in view of Zinser with 
the means of calculating distortion through cepstral distance as taught by Campbell to create a 
speech synthesis system in which modification distortion is calculated using cepstral distance, 
since cepstral distance is a specific example of the distance calculation taught by Kagoshima and 
a good way to describe a speech unit. 

10. Claims 10 and 22 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kagoshima et al in view of Zinser, and further in view of Hon et al (US. Patent: 6,490,563), 

With respect to Claims 10 and 22, Kagoshima in view of Zinser teaches the speech 
synthesis apparatus and method that utilizes a modification distortion in selecting a best speech 
unit for synthesizing speech, as applied to Claims 1 and 13. Kagoshima in view of Zinser does 
not teach the use of a table to determine a distortion, however Hon discloses: 

A speech signal processing apparatus and method, wherein the distortion obtaining means 
includes a table storing distortions, and determines the modification distortion by referring to the 
table (use of a unit inventory that contains speech instances and a decision tree that denotes the 
best speech instances with regard to a joint distortion function consisting of a concatenation and 
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prosody distortion, both of which may be stored in memory, Col 6, Line 58- Col 7, Line 5), 

Kagoshima, Zinser, and Hon are analogous art because they are from a similar field of 
endeavor in speech synthesis. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Kagoshima in view of Zinser with the 
use of an inventory and decision tree denoting speech instances with respect to concatenation and 
prosodic distortion in selecting a best speech instance as taught by Hon to create a means of 
saving distortion parameters for instances where similar text inputs exist- a stored distortion in an 
inventory and best instance saved in a decision tree could be looked up easily and be used for 
selecting the best speech instance, thus improving processing speed without degrading speech 
quality. It would also have been obvious to one of ordinary skill in the art, at the time of 
invention, to implement the inventory in a lookup table format, as is well known in the art, so 
that the speech unit with the least distortion could be selected. 

With respect to Claims 11 and 23, Kagoshima in view of Zinser teaches the speech 
synthesis apparatus and method that utilizes a modification distortion, calculated as the distance 
between an individual synthesis unit before and after modification, in selecting a best speech unit 
for synthesizing speech, as applied to Claims 1 and 13. Kagoshima in view of Zinser do not 
teach the use of a table to determine a concatenation distortion, however Hon discloses: 

A speech signal processing apparatus and method, wherein the distortion obtaining means 
includes a table storing distortions, and determines the modification distortion by referring to the 
table (use of a unit inventory that contains speech instances and a decision tree that denotes the 
best speech instances with regard to a joint distortion function consisting of a concatenation and 
prosody distortion, both of which may be stored in memory. Col 6, Line 58- Col 7, Line 5). 
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Kagoshima, Zinser, and Hon are analogous art because they are from a similar field of 
endeavor in speech synthesis. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Kagoshima in view of Zinser with the 
use of an inventory and decision tree denoting speech instances with respect to concatenation and 
prosodic distortion in selecting a best speech instance as taught by Hon to create a means of 
saving distortion parameters for instances where similar text inputs exist- a stored distortion in an 
inventory and best instance saved in a decision tree could be looked up easily and be used for 
selecting the best speech instance, thus improving processing speed without degrading speech 
quality. It would also have been obvious to one of ordinary skill in the art, at the time of 
invention, to implement the inventory in a lookup table format, as is well known in the art, so 
that the speech unit with the least distortion could be selected. 

With respect to Claim 25, Kagoshima and Zinser teach the limitations of claims of 1 and 
13. The aforementioned prior art does not specifically suggest method implementation using a 
storage medium, however, Hon discloses: 

A storage medium, capable of being read by a computer, storing a program for executing 
a speech signal processing method (computer readable storage medium containing computer 
instructions for implementing speech synthesis, Col 4, Lines 36-39). 

Kagoshima, Zinser, and Hon are analogous art because they are from a similar field of 
endeavor in speech synthesis. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Kagoshima and Zinser with the use of 
a computer readable medium for implementing a speech synthesis method as taught by Hon to 
store a speech processing method on a computer readable medium to increase method 
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compatibility and usability by providing a means for method use with multiple computer 
systems. 

Claim 29 contains subject matter similar to Claim 10, and thus, is rejected for the same 
reasons. 

Claim 30 contains subject matter similar to Claim 11, and thus, is rejected for the same 
reasons. 

Claim 31 contains subject matter similar to Claim 12, and thus, is rejected for the same 
reasons. 

1 1 . Claim 26 is rejected under 35 U.S.C. 103(a) as being unpatentable over Kagoshima et al 
in view of Zinser, in further view of Hon et al, and further in view of Huang et al. 

Claim 26 contains subject matter similar to Claim 6, and thus, is rejected for the same 
reasons. 

12. Claim 27 is rejected under 35 U.S.C. 103(a) as being unpatentable over Kagoshima et al 
in view of Zinser, in further view of Hon et al, further in view of Huang et al, and in yet further 
view of Campbell et al. 

Claim 27 contains subject matter similar to Claim 7, and thus, is rejected for the same 
reasons. 

13. Claim 28 is rejected under 35 U.S.C. 103(a) as being unpatentable over Kagoshima et al 
in view of Zinser, in further view of Hon et al, and further in view of Campbell et al. 
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Claim 28 contains subject matter similar to Claim 9, and thus, is rejected for the same 
reasons. 

Conclusion 

14. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1. 136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

15. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 

Valenzuela Steude et al (U.S. Patent: 5,097,508)- teaches a difference calculation 
between input speech and a synthesized input speech to select a result having the smallest error. 
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16. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (571) 272-7632 
and email is James.Wozniak@uspto.gov. The examiner can normally be reached on Mondays- 
Fridays, 8:30-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached at (571) 272-7582. The fax/phone number for the 
Technology Center 2600 where this application is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the technology center receptionist whose telephone number is (703) 306- 



0377. 




James S. Wozniak 
5/19/2005 



